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Method for reducing the amount of needed memory in a TDM 
switch system 

of the invention 



The present invention is related to data nodes in 
communication networks, in particular to transmitting 
independent serial data streams through synchronous Time 
Division Multiplexing (TDM) switches. 

Background of the invention 

The lower layer of communication networks like the 
connectivity layer in a core network of a cellular 
environment could be seen as a layer of distributed 
resources for managing data flows. Switches and 
multiplexers are some of the main components for this 
purpose. In complex communication networks managing data of 
different formats and varying data rates, it is of great 
importance that the construction of the components are 
flexible without being too complex. 



Conventionally, the switches comprise a number of serial 
inputs and outputs. The data stream of one input may be 

20 directed in its entirety to a certain output line, or it 
may consist of a mixture of time division multiplexed data 
frames that are to be distributed to several outputs. The 
different lines may be running various interfaces e.g. El, 
E2, E3 and STM-1 (figure 1). Additionally, the data speed 

25 of each input line may vary in a wide range. The transition 
of data frames in the switches is often executed by means 
of time slot buses located on the back plane of the 
switches. 



A variant of such a switch is illustrated in figure 2. In 
30 this example switch, there are up to 32 serial input lines 
and 32 serial output lines. The possible data speed of each 
line should vary from 8kbit/s up to 45Mbit/s. The 
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transmission of data is executed on one or more fast TDM 
buses, transporting the data frames on time slots 
preferably with a minimum delay. 

A problem occurring in switches handling various line 
interfaces is that each line will need a memory for 
temporary storage of frames before and after the TDM buses. 
The higher data rate on each line, the higher the 
requirement for the storage capacity. 

A synchronous digital TDM switch as described in [1] 
typically needs to store more than two frames of data on 
each line and in both directions (RX and TX) . If such a 
digital switch shall be a general switch dimensioned for 
any data rates on any lines (i.e. from 64 kbit /s and up to 
45Mbit/s), a large amount of memory could be necessary 
unless some restrictions on how the memory access and 
allocation are established. 

The straight forward way to transfer data coming from 
several data lines over a TDM bus is to allocate memory 
according to the maximum speed allowed on each line. If the 
maximum speed on each line is N bytes /frame, and the number 
of lines are L, then the required amount of memory needed 
is at least 4 x N x L bytes (need to store at least 4 
frames of data) . 

US 6,052,448 uses a method for reducing the amount of 
needed memory, but this is not done by dynamically 
allocation of the memory. If the maximum allowed 
transmission rate for each line is high, a large amount of 
memory is needed. Delays through the node will probably 
also be too high, often more than two TDM frames (250us) . 

Summary of the invention 

It is an object of the present invention to provide an 
arrangement that eliminates the drawbacks described above. 
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The features defined in the claims enclosed characterize 
this method. 

Brief description of the drawings 

In order to make the invention more readily understandable; 
5 the discussion that follows will refer to the accompanying 
drawings . 

Fig. 1 is a block diagram illustrating an example of serial 
output/input lines with variable interfaces connected to a 
TDM bus, e.g. in a switch, 

10 Fig. 2 shows an example switch including its output and 
input ports and a TDM bus, 

Fig. 3 is closer view of the example switch of fig. 2 
according to the present invention, 

Fig. 4 illustrates a buffer and its allocating pointers 
is according to the present invention, 

Fig. 5 shows the three different buffers on the receiving 
side according to an embodiment of the present invention, 
including connection tables and their interaction with the 
TDM buses, 

20 Fig. 6 illustrates the timer parameters according to the 
present invention in relation with a frame, 

Fig. 7 illustrates an example of how the timers according 
to the present invention may control the time at which the 
bytes of a frame is to be transferred, 

25 Fig. 8 shows the three different buffers on the 

transmitting side according to an embodiment of the present 
invention including the connection table, and their 
interaction with the TDM buses. 
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Detailed description of preferred embodiments 

The present invention describes a method for reducing the 
amount of memory needed when transferring data coming from 
several data lines over a common TDM bus. It also makes it 
possible to obtain very short data delays through the node. 

According to the present invention, by setting a 
restriction on the total bandwidth possible to transmit 
through the node (which is less than the maximum bandwidth 
for one line multiplied by the number of lines) and also 
introducing dynamically allocable memories, schedulers and 
FIFOs (illustrated in figure 3) it is possible to reduce 
the amount of needed memory. 

The FIFOs have two main functions. First, they take up 
jitter and wander variations between the line data rates 
and an internal master synchronisation signal. Second, 
since not every data line can access the data memories at 
the same time, FIFOs must be implemented on each line. 

On the transmitting side, the data is stored in FIFOs 
before it is clocked out with the respective serial clock. 
Either an internal DPLL or external PLL generates the 
serial clock. The FIFO is by default (and maximum) 64 bits 
deep, but can be configured by writing to a register. 

On the receiving side, the data is clocked into a FIFO by 
the serial clock before it is transmitted in one burst of 
data per frame against the scheduler. The FIFO is by 
default (and maximum) 64 bits deep, but can be configured 
by writing to a register. 

The schedulers are implemented to give all the lines access 
to the data memories on a regular basis. It is possible to 
program exactly at which time the scheduler shall request 
access to the memory for any ports. 
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By configuring the TDM switch in a so-called "MINIMUMJDELAY 
mode" and the scheduler in a special way, very short delays 
(mainly FIFO delays) through the node (32us for a 2Mbit/s 
link and 64 bit deep FIFOs) may be achieved. 

s The data memory or buffers is dynamically allocated by 

software. Each line can get the number of memory locations 
allocated that is requiredto store the amount of data for 
the actual bandwidth on the line. 

Conventionally, there is one buffer at the receiving side 
for each input line. However, according to the present 
invention, there is instead one or maximum two buffers for 
each TDM bus that is being shared between the input lines. 
This module is responsible for storing data coming from the 
scheduler, and put it onto the back plane bus in the 
correct timeslots. 

In one embodiment of the invention, there are three buses 
in the back plane, and the data is distributed among them. 
There are 8192 timeslots available on each bus (66MHz), 
which makes 24576 timeslots all together. There will be one 
set of memory and so-called connection tables for each bus. 
The data coming from the scheduler will go to all the 
memory blocks, while the connection tables are configured 
by software. This means that redundancy on the back plane 
may be achieved by making two of the connection tables 
identical. In this way two of the buses are holding the 
same data. The concept of connection tables will be 
explained later. 

The physical size of the memory blocks may be e.g. 2048 
bytes, which means that the system can handle a maximum of 
2048 different bytes per frame. The memories are configured 
in advance with a pointer value for every active port. 

The pointer value holds the address where the first byte in 
a frame is supposed to be stored. This address is loaded 
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into a local pointer at the start of a new frame, and the 
pointer is incremented after data has been written to this 
address. The memory can be shared by 32 different ports, 
but can also be occupied by only one. This concept is shown 

5 in figure 4. The buffers with their connection tables and 
TDM buses are illustrated in fig. 5. In this embodiment, 
there are three connection tables with 8192 entries each. 
Each entry has one address field and one control field. The 
address field points to the location in memory where the 

10 data can be found, while the control field holds 

information about e.g. if the current timeslot is enabled, 
if it is a minimum delay timeslot etc. 

L 

When reading data from the memories to the TDM bus, a 
timeslot counter is used to index the connection tables. 
15 The counter is synchronized to the bus clock, and is 

incremented along with the timeslots in the back plane. The 
back plane bus can transfer three different bytes at a 
time, so the current connection table entry will relate to 
timeslots x, x+1, and x+2 in parallel. 

20 The main function of the scheduler at the receiving side is 
to transfer data from the FIFOS into the buffers. In that 
respect, the scheduler uses a round- robin scheme on the 
transfer request register to check which ports requesting 
to transfer data. 

25 Information about which ports are active is stored in the 
RXSC_ACTP register. Only active ports will be considered in 
the round-robin scheme. The port with the smallest port 
number is served first. 

When a port has been served (a byte data has been 
30 transferred to the buffer) , the request bit in the 

RXSC_TREQ register is cleared, and the scheduler tells the 
buffer and the FIFO that the transaction is completed. 
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Every time the scheduler receives a frame strobe , the 
timers for the active ports will automatically set their 
associated RXSC_TREQ-register-bit to *1' . This will force 
the scheduler to transfer all the headers before it begins 
to transfer data. 

When the scheduler transfers data from a port to the 
buffer, it also sends the port address of the port 
currently transferring data. The port address is a number 
between 1 and 32, representing the number of the port. 

According to the present invention, the scheduler uses 
timers, one for each line. These timers are individually 
programmable, and defines when a byte from the memory 
should be transferred to a serial line (TX direction) , or 
when a byte from the lines should be transferred to the 
memory (RX direction) . 

When several lines request a data-transfer at the same 
time, a round-robin scheduler algorithm is used to serve 
the requests. 

There are 32 timers in the scheduler, one for each port. 
These timers are individually programmable, and defines 
when a byte from a serial port are to be written into a 
buffer. 

According to a preferred embodiment of the invention, four 
registers control each timer. These are called RX_NUMBER, 
RXJDFFSET, RX_DISTANCE and RX_DISTANCE_RESOLUTION . 

RX_NUMBER is defined as the number of bits (excluding the 
header) to be transferred from a port to the buffer during 
one frame. This number is a function of the bit rate of the 
port (PORT__BIT_RATE) . A port running at . a 45Mb/s bit rate, 
will need to transfer 5625 bit per frame; while a port 
running at a 8Mb/s bit rate, will need to transfer 1024 bit 
per frame. 
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NUMBER = 1 25/zy • PORT _ BIT _ RATE 

RX_OFFSET is defined as the number of clock cycles from 
frame start to the first transfer. This number is a 
function of RX_DI STANCE, RX_NUMBER and the distance from 
the last transferred byte to the end of the frame 
(EOFJDIST) . The minimum number for RXJDFFSET is 10, but 
must be at least 3 + (2 x (number of active ports)). Note 
that RXJDFFSET is the transfer of the first byte after the 
header, unlike in the scheduler at the transmitting side, 
where TX_OFFSET is the header transfer, 

RX_D I STANCE is the number of clock cycles of the system 
clock, between each transfer. This number is a function of 
the OFFSET, the NUMBER and the EOFJDIST. 



The RX_DISTANCE_RESOLUTION holds a six-bit resolution for 
the distance. A higher resolution than 1 clock cycle is 
needed for high-speed ports. As an example the distance for 
a 45Mbit port must be 22,625 and not 22 or 23. This is 
accomplished by setting RX_DI STANCE to 22 and 
RX_DISTANCE_RE SOLUTION to 0,625. 

The size of the FIFO will preferably be affected by these 
parameters, and vice versa. If RX_OFFSET or EOFJDIST are 
small, the RX_DISTANCE will be large and the FIFO size will 
be small; but if RX_OFFSET and/or EOF_DIST are large, the 
RX_DI STANCE will be small and the FIFO size will be large. 
Figure 6 shows the parameters related to a frame. 

With these four parameters it is possible to control 
exactly when the data is to be transferred from a port to 
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the buffer. Figure 7 shows an example where 3 bytes are 
transferred at timeslot 100,110 and 120. 

Each timer sets a bit in a transfer request register 
(RXSCJTREQ) , telling the scheduler that the port is ready 
to transfer data. 

Until now, only the receiving side of the switch is 
described. However, there is a similar memory/scheduler 
concept on the transmitting side of the switch. The buffers 
on the transmitting side and the corresponding connection 
table are shown in figure 8. These modules are responsible 
for storing data coming from the back plane bus, and pass 
it on to the scheduler by request. 

As already mentioned, in this embodiment, there are three 
buses in the back plane, and each of them can have 8192 
timeslots (66 MHz) . This makes 24576 timeslots all 
together. The data coming from the back plane will go to 
one large memory, and a connection table then addresses 
this memory. 

The connection table contains addresses and control bits. 
The addresses are used to point to locations in the large 
memory, while the control bits can be used to set a minimum 
delay. 

The size of the connection table is set to be 2048 bytes, 
which means that the system can handle a maximum of 2048 
different bytes per frame. The connection table is 
configured in advance with a size parameter and a pointer 
value for every active port. The pointer value holds the 
address where the first memory address can be found. This 
pointer value is loaded into a local pointer at the start 
of a new frame, and the pointer is incremented after data 
has been fetched from the location the pointer is 
addressing. The connection table can be shared by 32 
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different ports, but can also be occupied by only one. The 
size parameter controls the connection table allocation . 

A timeslot counter is used to index the large memory when 
fetching data from the back plane bus. Since there are 
5 three buses, three data bytes have to be fetched in 
parallel. 

There is also a scheduler at the transmitting side from the 
buffers to the FIFOs at the transmitting side. This 
scheduler also uses a round-robin scheme on the transfer 
io request register to check which ports requesting to 
transfer data. 

There are 32 timers in the scheduler, one for each port. 
These timers are individually programmable, and defines 
when a byte from a buffer should be transferred to a serial 
15 port. 

If several ports request a data-transfer at the same time, 
a round-robin scheduler algorithm is used to serve the 
requests. 

The scheduler uses a round-robin scheme on the transfer 
20 request register to check which ports want to receive data. 
Information about which ports are active is stored in the 
TXSC_ACTP register. 

Only active ports will be considered in the round-robin 
scheme. The port with the smallest port number is served 
25 first. 

When a port has been served (a byte data has been 
transferred to the buffer of the port), the request bit in 
the TXSCJTREQ register is cleared. 

When the scheduler is ready to transfer data from the 
30 buffer it sets up the port address for the port that wants 
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to receive data, and sends this address to the buffer. The 
port address is a number between 1 and 32, corresponding to 
the number of ports. The scheduler also sends a read signal 
to the buffer to tell it when it should write out the data 
on the 8 -bit bus. 

There are 32 timers in the scheduler, one for each port. 
These timers are individually programmable, and defines 
when a byte from a serial port are to be written into a 
buffer. The timing in the scheduler is not dependent on the 
bus frequency. It is only dependent on the system clock. 

According to a preferred embodiment of the invention, four 
registers control each timer. These are called TX_NUMBER, 
TXJDFFSET, TX_DI STANCE and TX_DISTANCE_RESOLUTION . 

When the timer reaches the TXJDFFSET value, the timer will 
automatically set its associated TXSC_TREQ-register-bit to 
x l' . This will force the scheduler to transfer the header 
before it begins to transfer data. 

When the scheduler has transferred data to a port, it 
clears the TXSC_TREQ-register-bit for this port and tells 
the FIFO that the transaction is completed- 

TX_NUMBER is defined as the number of bits (excluding the 
header) to be transferred from a buffer to a port during 
one frame. This number is a function of the bit rate of the 
port (PORT_BIT_RATE) . A port running at a 45Mb/s bit rate, 
will need to transfer 5625 bit per frame; while a port 
running at a 8Mb/s bit rate, will need to transfer 1024 bit 
per frame. 

NUMBER = \25jus-PORT_BIT_RATE 

TX_OFFSET is the number of cycles from a frame start to the 
first transfer. This number is a function of TX_DI STANCE, 
TX NUMBER and the distance from the last transferred byte 
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to the end of the frame (EOF_DIST) . TXJDFFSET is also 
dependent on whether the port is running in a minimum delay 
mode or a constant delay mode. In the case of minimum delay 
the minimum number for TX_OFFSET is a product of the actual 
delay from the receiving to the transmitting scheduler, but 
for regular mode the OFFSET can be set to minimum one. 

Note that TXJDFFSET is the header transfer, unlike in the 
receiving side, where RX_OFFSET is the first byte after the 



10 In scheduler-loop mode the TX_0FFSET is the first byte 

after the header and need to be set up with the same value 
as the RX_OFFSET register. 

It is preferred that all TX_OFFSETS in a minimum delay are 
set to different values. It is also recommended that the 
is data for a port be spread out equally across the frame to 
avoid under- or overflow in the FIFOs. 

TXJDI STANCE is the number of clock cycles of the system 
clock, between each transfer. This number is a function of 
TX OFFSET, TX NUMBER and the EOF_DIST. 



TX DISTANCE_RESOLUTION holds a six-bit resolution for the 
distance. A higher resolution than 1 clock cycle is needed 
for high-speed ports. As an example the distance for a 45 
25 Mbit port must be 22,625 and not 22 or 23. This is 
accomplished by setting TX__DI STANCE to 22 and 
TX_DISTANCE_RESOLUTION to 0,625. 

The size of the FIFO in the TX FRAME module will be 
affected by these parameters and vice versa. If OFFSET or 
30 EOF DIST are small, the DISTANCE will be large and the FIFO 
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size will be small; but if OFFSET and/or EOF__DIST are 
large, the DISTANCE will be small and the FIFO size will be 
large. 

With these four parameters it is possible to control 
5 exactly when the data is to be transferred from a buffer on 
the transmitting side to the ports. Each timer sets a bit 
in a transfer request register (TXSCJTREQ) , telling the 
scheduler that the port is ready to accept new data. 



The present invention may work either in a minimum delay 
10 mode or a constant delay mode. The minimum delay mode may 
e.g. be used in cases where the frames from a certain input 
line does not have to arrive at the transmitting side in 
the same order as they arrived at the receiving side. For 
achieving minimum delay, the register values may be 
is configured by means of the following algorithm. 



1. Find the NUMBER value by dividing the ports speed by 
8000. If the port speed is 34,368Mbit/s then NUMBER 
shall be 34.368.000 / 8000 = 4296. This number 
indicates the number of bits transferred each frame. 



20 2. Find the DISTANCE (registers DIST, DISTRS) value by 

dividing 131072 (system clock speed) by the (NUMBER + 
8) . If NUMBER is 4296 then DISTANCE shall be 131072 / 
(4296 + 8) = 30,453531. This value can be represented 
by the binary representation 11101,011101 = 

25 16+8+4+1+0.25+0.125+0.0625+0.015625=30, 453125 which 

is very near 30,453531. The registers DIST and DISTRS 
shall then both have the values 30 (11101) 
(DIST, DISTRS) . 



3. The RXJ3FFSET value should be set to 3 + 2 x N 
30 (N=number of active ports). If e.g. 8 ports are used, 

then RX OFFSET should be set to 3 + 2 x 8= 19 



WO 03/056771 



14 



PCT/NO02/00471 



4. The TXJDFFSET value is given by 131072 - NUMBER x 
DISTANCE - RXJDFFSET - 64-20. Lets say NUMBER=4296, 
DIST=30, DISTRS=30, RX_OFFSET=19 then TX_OFFSET 
should be 131072 - 4296 x 30,453125-19-64-20= 142. 
The numbers 64 and 20 corresponds to time for header 
transfer and time margin, respectively, because of 
asynchronous clock domains. 

In addition to select between minimum delay and constant 
delay for any timeslot, the connection tables at the 
receiving side are used to set up a connection from the 
input ports to the TDM timeslots. The connection table on 
the transmitting side sets up a connection from the TDM 
timeslots to the output ports. The following algorithm may 
be used to configure these tables. 

Find the TDM timeslot as described below. In the address 
field in the connection table of the receiving side 
indicated by the TS_NUMBER (ts) write the value ts. In the 
address field in the connection table of the transmitting 
side indicated by ts, write the value TS_NUMBER (ts) . 

By using the scheduler configuration described above, the 
following can be used to select TDM timeslots for a minimum 
delay connection. 

TS_NUMBER(ts) win = (RX_OFFSET+64+ 6+DI STANCE* ts) *ct_clk (f req) /131072000 
TS_NUMBER(ts) B8X = (TX_OFFSET- 6+DI STANCE* ts ) *ct_clk (f req) /131072000 

With the number found above, and ct__clk frequency =default 
TDM bus frequency we get the following min and max values: 

TS_NUMBER(ts (19+64+6+30, 453125*ts) *3*43, 691/131, 072=99+10, 151042*ts 
TS NUMBER(ts) wax ='(142-6+30,453125*ts)*3*43, 691/131,072=136+10, 151042*ts 
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Where ts is the serial bit stream timeslot number and 
TS_NUMBER is the TDM-bus timeslot number on one of the 
three TDM-buses. TS_NUMBER shall be rounded down to the 
nearest integer value. The ideal selection of TS_NUMBER (ts) 
is in the middle between TS_NUMBER (tsJmin and 
TS_NUMBER (ts ) max . 

The combination of a round robin scheduler connecting 
several data lines with FIFOs to a common memory make it 
possible to dynamically allocate the memory needed for each 
line. This reduces the amount of needed memory. The 
scheduler also reduce the need of large FIFOs since the 
scheduler give each port access to the data memory on a 
regular basis. It also makes it possible to transfer data 
through the node with very short delays. Typical delays for 
a 2Mbit/s in minimum delay mode are 32us against 300us in 
constant delay mode. 

An example of data RAM saving at the receiving side is 
discussed below. 

Consider three requirements: 

1. It shall be possible to connect 32 lines to the 
memory 

2. Each line shall have the possibility to be configured 
for data rates from 64kbit/s to 45Mbit/s 

3. The total capacity for the node need to be at least 
64Mbit/s 

Without a dynamically allocated memory concept about 700 
bytes (needed for 45Mbit/s) of memory for each line would 
have to be allocated to store one frame of data. This means 
32 x 700 bytes = 22.400 bytes per frame. With a dynamically 
allocation scheme we only need to allocate memory according 
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to the total capacity which is about 1.000 bytes per frame 
(needed for 64Mbit/s) . 

The main advantages with the present invention are that it 
provides a better utilization of buffer memory, and that it 
5 requires shorter FIFOs on both sides. 

In addition to reduce the required amount of data storage, 
the scheduler gives each port access to the data memory on 
a regular basis so that only small FIFOs are needed. 

Also, by setting up the scheduler parameters in a special 
way it is possible to obtain very short delays through the 
TDM switch. The total delays through the node may be mainly 
reduced to the FIFO delays. 

■ 

Finally, the present invention allows for both structured 
modus (the bytes in transfer on the time slot buses are 
identifiable) with constant delay and dependent timing, and 
for unstructured modus with minimum delay and both 
independent and dependent timing. This contributes to make 
the invention very useful and unique. 

The present invention is of principal character and 
20 describes streaming of data. The invention will therefore 
be of interest within other areas other than those 
described here. 
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Abbreviation & References 

FIFO First In First Out 



RX 



Receive direction (from local TDM bus to back 
plane TDM bus) 



5 TX Transmit direction (from back plane TDM bus to 

local TDM bus) 



[1] ETO/M/N/A-97:616; Synchronous Digital Switch; 

Reidar Schumann Olsen 



